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(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 
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(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/153,3 97 

(B) FILING DATE: 16 -NOV- 1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Coruzzi, Laura A. 

(B) REGISTRATION NUMBER: 30,742 

(C) REFERENCE/DOCKET NUMBER: 7683-031 

(ix) TELECOMMUNICATION INFORMATION: 
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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



(ix) FEATURE : 
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(A) NAME/KEY: CDS 

(B) LOCATION: 321.. 3077 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGGGCCTGAG ACTGGGGTGA CTGGGACCTA AGAGAATCCT GAGCTGGAGG CCCCCGACAG 

CTGCTCTCGG GAGCCGCCTC CCGACACCCG AGCCCCGCCG GCGCCTCCCG CTCCCGGCTC 

CCGGCTCCTG GCTCCCTCCG CCTCCCCCGC CCCTCGCCCC GCCGCCGAAG AGGCCCCGCT 

CCCGGGTCGG ACGCCTGGGT CTGC CGGGAA GAGCGATGAG AGGTGTCTGA AGGTGGCTAT 

TCACTGAGCG ATGGGGTTGG ACTTGAAGGA ATGCCAAGAG ATGCTGCCCC CACCCCCTTA 

GGCCCGAGGG ATCAGGAGCT ATG GGA CCA GAG GCC CTG TCA TCT TTA CTG 

Met Gly Pro Glu Ala Leu Ser Ser Leu Leu 
15 10 

CTG CTG CTC TTG GTG GCA AGT GGA GAT GCT GAC ATG AAG GGA CAT TTT 
Leu Leu Leu Leu Val Ala Ser Gly Asp Ala Asp Met Lys Gly His Phe 
15 20 25 

GAT CCT GCC AAG TGC CGC TAT GCC CTG GGC ATG CAG GAC CGG ACC ATC 
Asp Pro Ala Lys Cys Arg Tyr Ala Leu Gly Met Gin Asp Arg Thr He 
30 35 40 

CCA GAC AGT GAC ATC TCT GCT TCC AGC TCC TGG TCA GAT TCC ACT GCC 
Pro Asp Ser Asp He Ser Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala 
45 50 55 

GCC CGC CAC AGC AGG TTG GAG AGC AGT GAC GGG GAT GGG GCC TGG TGC 
Ala Arg His Ser Arg Leu Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys 
60 65 70 

CCC GCA GGG TCG GTG TTT CCC AAG GAG GAG GAG TAC TTG CAG GTG GAT 
Pro Ala Gly Ser Val Phe Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp 
75 80 85 90 

CTA CAA CGA CTC CAC CTG GTG GCT CTG GTG GGC ACC CAG GGA CGG CAT 
Leu Gin Arg Leu His Leu Val Ala Leu Val Gly Thr Gin Gly Arg His 
95 100 105 

GCC GGG GGC CTG GGC AAG GAG TTC TCC CGG AGC TAC CGG CTG CGT TAC 
Ala Gly Gly Leu Gly Lys Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr 
HO 115 120 

TCC CGG GAT GGT CGC CGC TGG ATG GGC TGG AAG GAC CGC TGG GGT CAG 
Ser Arg Asp Gly Arg Arg Trp Met Gly Trp Lys Asp Arg Trp Gly Gin 
125 130 135 

GAG GTG ATC TCA GGC AAT GAG GAC CCT GAG GGA GTG GTG CTG AAG GAC 
Glu Val He Ser Gly Asn Glu Asp Pro Glu Gly Val Val Leu Lys Asp 
140 145 150 
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I.n 



■3 # 



CTT GGG CCC CCC ATG GTT GCC CGA CTG GTT CGC TTC TAC CCC CGG GCT 830 
Leu Gly Pro Pro Met Val Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala 
155 160 165 170 

GAC CGG GTC ATG AGT GTC TGT CTG CGG GTA GAG CTC TAT GGC TGC CTC 8 78 

Asp Arg Val Met Ser Val Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu 
175 180 185 

TGG AGG GAT GGA CTC CTG TCT TAC ACC GCC CCT GTG GGG CAG ACA ATG 92 6 

Trp Arg Asp Gly Leu Leu Ser Tyr Thr Ala Pro Val Gly Gin Thr Met 
190 195 200 

TAT TTA TCT GAG GCC GTG TAC CTC AAC GAC TCC ACC TAT GAC GGA CAT 974 
Tyr Leu Ser Glu Ala Val Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His 
205 210 215 

ACC GTG GGC GGA CTG CAG TAT GGG GGT CTG GGC CAG CTG GCA GAT GGT 1022 
Thr Val Gly Gly Leu Gin Tyr Gly Gly Leu Gly Gin Leu Ala Asp Gly 
220 225 230 

GTG GTG GGG CTG GAT GAC TTT AGG AAG AGT CAG GAG CTG CGG GTC TGG 1070 
Val Val Gly Leu Asp Asp Phe Arg Lys Ser Gin Glu Leu Arg Val Trp 
235 240 245 250 



CCA GGC TAT GAC TAT GTG GGA TGG AGC AAC CAC AGC TTC TCC AGT GGC 1118 
Pro Gly Tyr Asp Tyr Val Gly Trp Ser Asn His Ser Phe Ser Ser Gly 
Iff 255 260 265 



|,* TAT GTG GAG ATG GAG TTT GAG TTT GAC CGG CTG AGG GCC TTC CAG GCT 1166 

|;p Tyr Val Glu Met Glu Phe Glu Phe Asp Arg Leu Arg Ala Phe Gin Ala 

\S 270 275 280 



ATG CAG GTC CAC TGT AAC AAC ATG CAC ACG CTG GGA GCC CGT CTG CCT 1214 
Met Gin Val His Cys Asn Asn Met His Thr Leu Gly Ala Arg Leu Pro 
285 290 295 

GGC GGG GTG GAA TGT CGC TTC CGG CGT GGC CCT GCC ATG GCC TGG GAG 1262 
Gly Gly Val Glu Cys Arg Phe Arg Arg Gly Pro Ala Met Ala Trp Glu 
300 305 310 

GGG GAG CCC ATG CGC CAC AAC CTA GGG GGC AAC CTG GGG GAC CCC AGA 1310 
Gly Glu Pro Met Arg His Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg 
3X5 320 325 330 

GCC CGG GCT GTC TCA GTG CCC CTT GGC GGC CGT GTG GCT CGC TTT CTG 13 5 8 

Ala Arg Ala Val Ser Val Pro Leu Gly Gly Arg Val Ala Arg Phe Leu 
335 340 345 

CAG TGC CGC TTC CTC TTT GCG GGG CCC TGG TTA CTC TTC AGC GAA ATC 14 06 

Gin Cys Arg Phe Leu Phe Ala Gly Pro Trp Leu Leu Phe Ser Glu lie 
350 355 360 

TCC TTC ATC TCT GAT GTG GTG AAC AAT TCC TCT CCG GCA CTG GGA GGC 14 54 

Ser Phe lie Ser Asp Val Val Asn Asn Ser Ser Pro Ala Leu Gly Gly 
365 370 375 
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ACC TTC CCG CCA GCC CCC TOG TGG CCQ CCT GGC CCA CCT CCC ACC AAC 
Thr Phe Pro Pro Ala Pro Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn 
380 385 390 

TTC AGC AGC TTG GAG CTG GAG CCC AGA GGC CAG CAG CCC GTG GCC AAG 
Phe Ser Ser Leu Glu Leu Glu Pro Arg Gly Gin Gin Pro Val Ala Lys 
395 400 405 4 i 0 

GCC GAG GGG AGC CCG ACC GCC ATC CTC ATC GGC TGC CTG GTG GCC ATC 
Ala Glu Gly Ser Pro Thr Ala lie Leu He Gly Cys Leu Val Ala He 
415 420 425 

ATC CTG CTC CTG CTG CTC ATC ATT GCC CTC ATG CTC TGG CGG CTG CAC 
He Leu Leu Leu Leu Leu He He Ala Leu Met Leu Trp Arg Leu His 
430 435 440 

TGG CGC AGG CTC CTC AGC AAG GCT GAA CGG AGG GTG TTG GAA GAG GAG 
Trp Arg Arg Leu Leu Ser Lys Ala Glu Arg Arg Val Leu Glu Glu Glu 
445 450 455 

CTG ACG GTT CAC CTC TCT GTC CCT GGG GAC ACT ATC CTC ATC AAC AAC 
Leu Thr Val His Leu Ser Val Pro Gly Asp Thr He Leu He Asn Asn 
460 465 470 

CGC CCA GGT CCT AGA GAG CCA CCC CCG TAC CAG GAG CCC CGG CCT CGT 
Arg Pro Gly Pro Arg Glu Pro Pro Pro Tyr Gin Glu Pro ArqPro Arc 
475 4 80 485 490 

GGG AAT CCG CCC CAC TCC GCT CCC TGT GTC CCC AAT GGC TCT GCG TTG 
Gly Asn Pro Pro His Ser Ala Pro Cys Val Pro Asn Gly Ser Ala Leu 
495 500 505 

CTG CTC TCC AAT CCA GCC TAC CGC CTC CTT CTG GCC ACT TAC GCC CGT 
Leu Leu Ser Asn Pro Ala Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg 
510 515 520 

CCC CCT CGA GGC CCG GGC CCC CCC ACA CCC GCC TGG GCC AAA CCC ACC 
Pro Pro Arg Gly Pro Gly Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr 



S 525 530 



535 



AAC ACC CAG GCC TAC AGT GGG GAC TAT ATG GAG CCT GAG AAG CCA GGC 
Asn Thr Gin Ala Tyr Ser Gly Asp Tyr Met Glu Pro Glu Lys Pro Glv 
540 545 5 5o 

GCC CCG CTT CTG CCC CCA CCT CCC CAG AAC AGC GTC CCC CAT TAT GCC 
Ala Pro Leu Leu Pro Pro Pro Pro Gin Asn Ser Val Pro His Tyr Ala 
555 560 565 570 

GAG GCT GAC ATT GTT ACC CTG CAG GGC GTC ACC GGG GGC AAC ACC TAT 
Glu Ala Asp He Val Thr Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr 
575 580 585 

GCT GTG CCT GCA CTG CCC CCA GGG GCA GTC GGG GAT GGG CCC CCC AGA 
Ala Val Pro Ala Leu Pro Pro Gly Ala Val Gly Asp Gly Pro Pro Arg 
590 595 60Q 



1502 



1550 



1598 



1646 



1694 



1742 



1790 



1838 



1886 



1934 



1982 



2030 



2078 



2126 
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a 



GTG GAT TTC CCT CGA TCT CGA CTC CGC TTC AAG GAG AAG CTT GGC GAG 2174 
Val Asp Phe Pro Arg Ser Arg Leu Arg Phe Lys Glu Lys Leu Gly Glu 
605 610 615 

GGC CAG TTT GGG GAG GTG CAC CTG TGT GAG GTC GAC AGC CCT CAA GAT 2222 
Gly Gin Phe Gly Glu Val His Leu Cys Glu Val Asp Ser Pro Gin Asp 
620 625 630 

CTG GTC AGT CTT GAT TTC CCC CTT AAT GTG CGT AAG GGA CAC CCT TTG 227 0 

Leu Val Ser Leu Asp Phe Pro Leu Asn Val Arg Lys Gly His Pro Leu 
635 640 645 650 

CTG GTA GCT GTC AAG ATC TTA CGG CCA GAT GCC ACC AAG AAT GCC AGC 2 318 

Leu Val Ala Val Lys lie Leu Arg Pro Asp Ala Thr Lys Asn Ala Ser 
655 660 665 

TTC TCC TTG TTC TCC AGG AAT GAT TTC CTG AAA GAG GTG AAG ATC ATG 2 3 66 

Phe Ser Leu Phe Ser Arg Asn Asp Phe Leu Lys Glu Val Lys lie Met 
670 675 680 

TCG AGG CTC AAG GAC CCC AAC ATC ATT CGG CTG CTG GGC GTG TGT GTG 2414 
Ser Arg Leu Lys Asp Pro Asn lie lie Arg Leu Leu Gly Val Cys Val 
685 690 695 

CAG GAC GAC CCC CTC TGC ATG ATT ACT GAC TAC ATG GAG AAC GGC GAC 24 62 

Gin Asp Asp Pro Leu Cys Met lie Thr Asp Tyr Met Glu Asn Gly Asp 



i;i 5 _ _ 

|n 700 705 " " 710 



CTC AAC CAG TTC CTC AGT GCC CAC CAG CTG GAG GAC AAG GCA GCC GAG 2510 
Leu Asn Gin Phe Leu Ser Ala His Gin Leu Glu Asp Lys Ala Ala Glu 
715 720 725 730 



000 GCC CCT 000 GAC 000 CAG GCT GCG GAG GGG CCC ACC ATC AGC TAC 2558 
W Gly Ala Pro Gly Asp Gly Gin Ala Ala Gin Gly Pro Thr He Ser Tyr 

# 735 740 745 



CCA ATG CTG CTG CAT GTG GCA GCC CAG ATC GCC TCC GGC ATG CGC TAT 2 606 

Pro Met Leu Leu His Val Ala Ala Gin He Ala Ser Gly Met Arg Tyr 
750 755 760 

CTG GCC ACA CTC AAC TTT GTA CAT CGG GAC CTG GCC ACG CGG AAC TGC 2654 
Leu Ala Thr Leu Asn Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys 
765 770 775 

CTA GTT GGG GAA AAT TTC ACC ATC AAA ATC GCA GAC TTT GGC ATG AGC 2702 
Leu Val Gly Glu Asn Phe Thr He Lys He Ala Asp Phe Gly Met Ser 
780 785 790 

CGG AAC CTC TAT GCT GGG GAC TAT TAC CGT GTG CAG GGC CGG GCA GTG 2 750 

Arg Asn Leu Tyr Ala Gly Asp Tyr Tyr Arg Val Gin Gly Arg Ala Val 
795 800 805 810 

CTG CCC ATC CGC TGG ATG GCC TGG GAG TGC ATC CTC ATG GGG AAG TTC 2 798 

Leu Pro He Arg Trp Met Ala Trp Glu Cys He Leu Met Gly Lys Phe 
815 820 825 
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AOG ACT GCG AGT GAC GTG TOO GCC TTT GOT GTG ACC CTG TGG GAG GTG 
Thr Thr Ala Ser Asp Val Trp Ala Phe Gly Val Thr Leu Trp Glu Val 
830 835 840 

CTG ATG CTC TGT AGG GCC CAG CCC TTT GGG CAG CTC ACC GAC GAG CAG 
Leu Met Leu Cys Arg Ala Gin Pro Phe Gly Gin Leu Thr Asp Glu Gin 
845 850 855 

GTC ATC GAG AAC GCG GGG GAG TTC TTC CGG GAC CAG GGC CGG CAG GTG 
Val He Glu Asn Ala Gly Glu Phe Phe Arg Asp Gin Gly Arg Gin Val 
860 865 870 

TAC CTG TCC CGG CCG CCT GCC TGC CCG CAG GGC CTA TAT GAG CTG ATG 
Tyr Leu Ser Arg Pro Pro Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met 
875 880 8 85 8 90 

CTT CGG TGC TGG AGC CGG GAG TCT GAG CAG CGA CCA CCC TTT TCC CAG 
Leu Arg Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin 
895 900 905 

CTG CAT CGG TTC CTG GCA GAG GAT GCA CTC AAC ACG GTG TGAATCACAC 
Leu His Arg Phe Leu Ala Glu Asp Ala Leu Asn Thr Val 
910 915 

ATCCAGCTGC CCCTCCCTCA GGGAGTGATC CAGGGGAAGC CAGTGACACT AAAACAAGAG 
GACACAATGG CACCTCTGCC CTTCCCCTCC CGACAGCCCA TCACCTCTAA TAGAGGCAGT 
GAGACTGCAG GTGGGCTGGG CCCACCCAGG GAGCTGATGC CCCTTCTCCC CTTCCTGGAC 
ACACTCTCAT GTCCCCTTCC TGTTCTTCCT TCCTAGAAGC CCCTGTCGCC CACCCAGCTG 
GTCCTGTGGA TGGGATCCTC TCCACCCTCC TCTAGCCATC CCTTGGGGAA GGGTGGGGAG 
AAATATAGGA TAGACACTGG ACATGGCCCA TTGGAGCACC TGGGCCCCAC TGGACAACAC 
TGATTCCTGG AGAGGTGGCT GCGCCCCAGC TTCTCTCTCC CTGTCACACA CTGGACCCCA 
CTGGCTGAGA ATCTGGGGGT GAGGAGGACA AGAAGGAGAG GAAAATGTTT CCTTGTGCCT 
GCTCCTGTAC TTGTCCTCAG CTTGGGCTTC TTCCTCCTCC ATCACCTGAA ACACTGGACC 
TGGGGGTAGC CCCGCCCCAG CCCTCAGTCA CCCCCACTTC CCACTTGCAG TCTTGTAGCT 
AGAACTTCTC TAAGCCTATA CGTTTCTGTG GAGTAAATAT TGGGATTGGG GGGAAAGAGG 
GAGCAACGGC CCATAGCCTT GGGGTTGGAC ATCTCTAGTG TAGCTGCCAC ATTGATTTTT 
CTATAATCAC TTGGGGTTTG TACATTTTTG GGGGGAGAGA CACAGATTTT TACACTAATA 
TATGGACCTA GCTTGAGGCA ATTTTAATCC CCTGCACTAG GCAGGTAATA ATAAAGGTTG 
AGTTTTCCAC AAAAAAAAAA AAAAAACCGG AATTC 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 



2846 

i 

2894 

2942 

2990 

3038 

3087 

3147 
3207 
3267 
3327 
3387 
3447 
3507 
3567 

3627 

3687 

3747 

3807 

3867 

3927 

3962 
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(A) LENGTH: 919 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Gly Pro Glu Ala Leu Ser Ser Leu Leu Leu Leu Leu Leu Val Ala 
1 5 10 15 

Ser Gly Asp Ala Asp Met Lys Gly His Phe Asp Pro Ala Lys Cys Arg 
20 25 30 

Tyr Ala Leu Gly Met Gin Asp Arg Thr He Pro Asp Ser Asp He Ser 
35 40 45 

Ala Ser Ser Ser Trp Ser Asp Ser Thr Ala Ala Arg His Ser Arcr Leu 
50 55 60 

Glu Ser Ser Asp Gly Asp Gly Ala Trp Cys Pro Ala Gly Ser Val Phe 
65 70 75 80 

Pro Lys Glu Glu Glu Tyr Leu Gin Val Asp Leu Gin Arg Leu His Leu 
85 90 95 

Val Ala Leu Val Gly Thr Gin Gly Arg His Ala Gly Gly Leu Gly Lys 
100 105 HO 

Glu Phe Ser Arg Ser Tyr Arg Leu Arg Tyr Ser Arg Asp Gly Arg Arg 
115 120 125 

Trp Met Gly Trp Lys Asp Arg Trp Gly Gin Glu Val He Ser Gly Asn 
130 135 140 

Glu Asp Pro Glu Gly Val Val Leu Lys Asp Leu Gly Pro Pro Met Val 
145 150 155 leo 

Ala Arg Leu Val Arg Phe Tyr Pro Arg Ala Asp Arg Val Met Ser Val 
165 170 175 

Cys Leu Arg Val Glu Leu Tyr Gly Cys Leu Trp Arg Asp Gly Leu Leu 
180 . 185 190 

Ser Tyr Thr Ala Pro Val Gly Gin Thr Met Tyr Leu Ser Glu Ala Val 
195 200 205 

Tyr Leu Asn Asp Ser Thr Tyr Asp Gly His Thr Val Gly Gly Leu Gin 
210 215 220 

Tyr Gly Gly Leu Gly Gin Leu Ala Asp Gly Val Val Gly Leu Asp Asp 
225 230 235 240 

Phe Arg Lys Ser Gin Glu Leu Arg Val Trp Pro Gly Tyr Asp Tyr Val 
245 250 255 

Gly Trp Ser Asn His Ser Phe Ser Ser Gly Tyr Val Glu Met Glu Phe 
260 265 270 



-58- 



Glu Phe Asp Arg Leu Arg Ala Phe Gin Ala Met Gin Val His Cys Asn 
275 280 285 

Asn Met His Thr Leu Gly Ala Arg Leu Pro Gly Gly Val Glu Cys Arg 
290 295 300 

Phe Arg Arg Gly Pro Ala Met Ala Trp Glu Gly Glu Pro Met Arg His 
305 310 315 320 

Asn Leu Gly Gly Asn Leu Gly Asp Pro Arg Ala Arg Ala Val Ser Val 
325 330 335 

Pro Leu Gly Gly Arg Val Ala Arg Phe Leu Gin Cys Arg Phe Leu Phe 
340 345 350 

Ala Gly Pro Trp Leu Leu Phe Ser Glu He Ser Phe He Ser Asp Val 
355 360 365 

Val Asn Asn Ser Ser Pro Ala Leu Gly Gly Thr Phe Pro Pro Ala Pro 
370 375 380 

Trp Trp Pro Pro Gly Pro Pro Pro Thr Asn Phe Ser Ser Leu Glu Leu 
385 390 395 400 

Glu Pro Arg Gly Gin Gin Pro Val Ala Lys Ala Glu Gly Ser Pro Thr 
405 410 -415 

Ala He Leu He Gly Cys Leu Val Ala He He Leu Leu Leu Leu Leu 
420 425 430 

He He Ala Leu Met Leu Trp Arg Leu His Trp Arg Arg Leu Leu Ser 
435 440 445 

Lys Ala Glu Arg Arg Val Leu Glu Glu Glu Leu Thr Val His Leu Ser 
450 455 460 

Val Pro Gly Asp Thr He Leu He Asn Asn Arg Pro Gly Pro Arg Glu 
465 470 475 480 

Pro Pro Pro Tyr Gin Glu Pro Arg Pro Arg Gly A^n Pro Pro His Ser 
485 490 495 

Ala Pro Cys Val Pro Asn Gly Ser Ala Leu Leu Leu Ser Asn Pro Ala 
500 505 510 

Tyr Arg Leu Leu Leu Ala Thr Tyr Ala Arg Pro Pro Arg Gly Pro Gly 
515 520 525 

Pro Pro Thr Pro Ala Trp Ala Lys Pro Thr Asn Thr Gin Ala Tyr Ser 
530 535 540 

Gly Asp Tyr Met Glu Pro Glu Lys Pro Gly Ala Pro Leu Leu Pro Pro 
545 550 555 560 

Pro Pro Gin Asn Ser Val Pro His Tyr Ala Glu Ala Asp He Val Thr 
565 570 575 
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Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr Ala Val Pro Ala Leu Pro 
580 585 590 

Pro Gly Ala Val Gly Asp Gly Pro Pro Arg Val Asp Phe Pro Arg Ser 
595 600 605 

Arg Leu Arg Phe Lys Glu Lys Leu Gly Glu Gly Gin Phe Gly Glu Val 
61 ° 615 620 

His Leu Cys Glu Val Asp Ser Pro Gin Asp Leu Val Ser Leu Asp Phe 
625 630 6 35 6 40 

Pro Leu Asn Val Arg Lys Gly His Pro Leu Leu Val Ala Val Lys He 
645 650 655 

Leu Arg Pro Asp Ala Thr Lys Asn Ala Ser Phe Ser Leu Phe Ser Arg 
660 665 670 

Asn Asp Phe Leu Lys Glu Val Lys He Met Ser Arg Leu Lys Asp Pro 
67S 680 685 

Asn He He Arg Leu Leu Gly Val Cys Val Gin Asp Asp Pro Leu Cys 
69 ° 695 700 

Met He Thr Asp Tyr Met Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser 
705 7 1° 715 720 

Ala His Gin Leu Glu Asp Lys Ala Ala Glu Gly Ala Pro Gly Asp Gly 
725 730 735 

Gin Ala Ala Gin Gly Pro Thr He Ser Tyr Pro Met Leu Leu His Val 
740 745 750 

Ala Ala Gin He Ala Ser Gly Met Arg Tyr Leu Ala Thr Leu Asn Phe 
755 760 765 

Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Glu Asn Phe 
770 775 780 

Thr He Lys He Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr Ala Glv 
785 7 90 795 soo 

Asp Tyr Tyr Arg Val Gin Gly Arg Ala Val Leu Pro He Arg Trp Met 
805 810 815 

Ala Trp Glu Cys He Leu Met Gly Lys Phe Thr Thr Ala Ser Asp Val 
820 825 830 

Trp Ala Phe Gly Val Thr Leu Trp Glu Val Leu Met Leu Cys Arg Ala 
835 840 845 

Gin Pro Phe Gly Gin Leu Thr Asp Glu Gin Val He Glu Asn Ala Gly 
850 855 860 

Glu Phe Phe Arg Asp Gin Gly Arg Gin Val Tyr Leu Ser Arg Pro Pro 
865 870 875 880 
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Ala Cys Pro Gin Gly Leu Tyr Glu Leu Met Leu Arg Cys Trp Ser Arg 
885 890 895 

Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin Leu His Arg Phe Leu Ala 
900 905 910 

Glu Asp Ala Leu Asn Thr Val 
915 



(2) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI- SENSE: NO 



|p (ix) FEATURE: 

jl (A) NAME/KEY: CDS 

(B) LOCATION: 370.. 2934 



m 

Si*? 



Hi 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

GCACGAGCGG CACGAGTCCA TGATCTCTTT CCATCCTCCC TTTCCTGTTT GCTCACTTCT 

TTTCTTGCTC ATCTTGGAGA CTGTGCAATC CCAGATTAAC TACAAACAGA GAAGAGCTGG 

TGATAGCTCC AGAGCTCAGA GAAAGGAGGT CTCTTTACAA GAAGTCTGGC TCTCAAAGCC 

TCCATCAAGG GAGACCTACA AGTTGCCTGG GGTTCAGTGC TCTAGAAAGT TCCAAGGTTT 

GTGGCTTGAA TTATTCTAAA GAAGCTGAAA TAATTGAAGA G^AAGCAGAGG CCAGCTGTTT 

TTGAGGATCC TGCTCCACAG AGAATGCTCT GCACCCGTTG ATACTCCAGT TCCAACACCA 

TCTTCTGAG ATG ATC CTG ATT CCC AGA ATG CTC TTG GTG CTG TTC CTG 
Met He Leu He Pro Arg Met Leu Leu Val Leu Phe Leu 
1 5 10 

CTG CTG CCT ATC TTG AGT TCT GCA AAA GCT CAG GTT AAT CCA GCT ATA 
Leu Leu Pro He Leu Ser Ser Ala Lys Ala Gin Val Asn Pro Ala He 
15 20 25 

TGC CGC TAT CCT CTG GGC ATG TCA GGA GGC CAG ATT CCA GAT GAG GAC 
Cys Arg Tyr Pro Leu Gly Met Ser Gly Gly Gin He Pro Asp Glu Asp 
30 35 40 4 5 

ATC ACA GCT TCC AGT CAG TGG TCA GAG TCC ACA GCT GCC AAA TAT GGA 
He Thr Ala Ser Ser Gin Trp Ser Glu Ser Thr Ala Ala Lys Tyr Gly 
50 55 60 



60 
120 
180 
240 
300 
360 
408 

456 

504 

552 
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AGG CTG GAC TCA GAA GAA GGG GAT GGA GCC TGG TGC CCT GAG ATT CCA 600 
Arg Leu Asp Ser Glu Glu Gly Asp Gly Ala Trp Cys Pro Glu He Pro 
65 70 75 

GTG GAA CCT GAT GAC CTG AAG GAG TTT CTG CAG ATT GAC TTG CAC ACC 648 
Val Glu Pro Asp Asp Leu Lys Glu Phe Leu Gin He Asp Leu His Thr 
80 85 90 

CTC CAT TTT ATC ACT CTG GTG GGG ACC CAG GGG CGC CAT GCA GGA GGT 6 96 

Leu His Phe lie Thr Leu Val Gly Thr Gin Gly Arg His Ala Gly Gly 
95 100 105 

CAT GGC ATC GAG TTT GCC CCC ATG TAC AAG ATC AAT TAC AGT CGG GAT 744 
His Gly He Glu Phe Ala Pro Met Tyr Lys He Asn Tyr Ser Arg Asp 
110 115 120 125 

GGC ACT CGC TGG ATC TCT TGG CGG AAC CGT CAT GGG AAA CAG GTG CTG 7 92 

Gly Thr Arg Trp He Ser Trp Arg Asn Arg His Gly Lys Gin Val Leu 
130 135 140 

GAT GGA AAT AGT AAC CCC TAT GAC ATT TTC CTA AAG GAC TTG GAG CCG 84 0 

Asp Gly Asn Ser Asn Pro Tyr Asp He Phe Leu Lys Asp Leu Glu Pro 
145 150 155 

m CCC ATT GTA GCC AGA TTT GTC CGG TTC ATT CCA GTC ACC GAC CAC TCC 88 8 

IJA Pro He Val Ala Arg Phe Val Arg Phe He Pro Val Thr Asp His Ser 
111 160 165 170 

| lte L ATG AAT GTG TGT ATG AGA GTG GAG CTT TAC GGC TGT GTC TGG CTA GAT 936 

CO Met Asn Val CyS Met Arg Val Glu Leu T Y r Gl y °Y S Val Tr P Leu Asp 

j'« 175 180 185 

GGC TTG GTG TCT TAC AAT GCT CCA GCT GGG CAG CAG TTT GTA CTC CCT 984 
Gly Leu Val Ser Tyr Asn Ala Pro Ala Gly Gin Gin Phe Val Leu Pro 
190 195 200 205 

H GGA GGT TCC ATC ATT TAT CTG AAT GAT TCT GTC TAT GAT GGA GCT GTT 103 2 

O G1 Y G1 Y s er He He Tyr Leu Asn Asp Ser Val Tyr Asp Gly Ala Val 
O 210 215 220 

GGA TAC AGC ATG ACA GAA GGG CTA GGC CAA TTG ACC GAT GGT GTG TCT 1080 
Gly Tyr Ser Met Thr Glu Gly Leu Gly Gin Leu Thr Asp Gly Val Ser 
225 230 235 

GGC CTG GAC GAT TTC ACC CAG ACC CAT GAA TAC CAC GTG TGG CCC GGC 112 8 

Gly Leu Asp Asp Phe Thr Gin Thr His Glu Tyr His Val Trp Pro Gly 
240 245 250 

TAT GAC TAT GTG GGC TGG CGG AAC GAG AGT GCC ACC AAT GGC TAC ATT 1176 
Tyr Asp Tyr Val Gly Trp Arg Asn Glu Ser Ala Thr Asn Gly Tyr He 
255 260 265 

GAG ATC ATG TTT GAA TTT GAC CGC ATC AGG AAT TTC ACT ACC ATG AAG 1224 
Glu He Met Phe Glu Phe Asp Arg He Arg Asn Phe Thr Thr Met Lys 
270 275 280 285 
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m 

W 



GTC CAC TGC AAC AAC ATG TTT GOT AAA GGT GTG AAG ATC TTT AAG GAG 1272 
Val His Cys Asn Asn Met Phe Ala Lys Gly Val Lys He Phe Lys Glu 
290 295 300 

GTA CAG TGC TAC TTC CGC TCT GAA GCC AGT GAG TGG GAA CCT AAT GCC 
Val Gin Cys Tyr Phe Arg Ser Glu Ala Ser Glu Trp Glu Pro Asn Ala 
305 310 315 



ATT TCC TTC CCC CTT GTC CTG GAT GAC GTC AAC CCC AGT GCT CGG TTT 
He Ser Phe Pro Leu Val Leu Asp Asp Val Asn Pro Ser Ala Arg Phe 
320 325 330 

GTC ACG GTG CCT CTC CAC CAC CGA ATG GCC AGT GCC ATC AAG TGT CAA 
Val Thr Val Pro Leu His His Arg Met Ala Ser Ala He Lys Cys Gin 
335 340 345 

TAC CAT TTT GCA GAT ACC TGG ATG ATG TTC AGT GAG ATC ACC TTC CAA 
Tyr His Phe Ala Asp Thr Trp Met Met Phe Ser Glu He Thr Phe Gin 
350 355 360 365 

TCA GAT GCT GCA ATG TAC AAC AAC TCT GAA GCC CTG CCC ACC TCT CCT 
Ser Asp Ala Ala Met Tyr Asn Asn Ser Glu Ala Leu Pro Thr Ser Pro 
370 375 380 

m ATG GCA CCC ACA ACC TAT GAT CCA ATG CTT AAA GTT GAT GAC AGC AAC 

ly Met Ala Pro Thr Thr Tyr Asp Pro Met Leu Lys Val Asp Asp Ser Asn 

lR 385 390 395 



1320 



1368 



1416 



1464 



1512 



1560 



ACT CGG ATC CTG ATT GGC TGC TTG GTG GCC ATC ATC TTT ATC CTC CTG 
Thr Arg He Leu He Gly Cys Leu Val Ala He He Phe He Leu Leu 
400 405 410 

GCC ATC ATT GTC ATC ATC CTC TGG AGG CAG TTC TGG CAG AAA ATG CTG 
Ala He He Val He He Leu Trp Arg Gin Phe Trp Gin Lys Met Leu 
415 420 425 

GAG AAG GCT TCT CGG AGG ATG CTG GAT GAT GAA ATG ACA GTC AGC CTT 
Glu Lys Ala Ser Arg Arg Met Leu Asp Asp Glu Met Thr Val Ser Leu 
430 435 440 445 

TCC CTG CCA AGT GAT TCT AGC ATG TTC AAC AAT AAC CGC TCC TCA TCA 
Ser Leu Pro Ser Asp Ser Ser Met Phe Asn Asn Asn Arg Ser Ser Ser 
450 455 460 

CCT AGT GAA CAA GGG TCC AAC TCG ACT TAC GAT CGC ATC TTT CCC CTT 
Pro Ser Glu Gin Gly Ser Asn Ser Thr Tyr Asp Arg He Phe Pro Leu 
465 470 475 

CGC CCT GAC TAC CAG GAG CCA TCC AGG CTG ATA CGA AAA CTC CCA GAA 
Arg Pro Asp Tyr Gin Glu Pro Ser Arg Leu He Arg Lys Leu Pro Glu 
480 485 490 

TTT GCT CCA GGG GAG GAG GAG TCA GGC TGC AGC GGT GTT GTG AAG CCA 1896 
Phe Ala Pro Gly Glu Glu Glu Ser Gly Cys Ser Gly Val Val Lys Pro 
495 500 505 



1608 



1656 



1704 



1752 



1800 



1848 
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GTC CAG CCC AGT GGC CCT GAG GGG GTG CCC CAC TAT GCA GAG GCT GAC 1944 
Val Gin Pro Ser Gly Pro Glu Gly Val Pro His Tyr Ala Glu Ala Asp 
510 515 520 525 

ATA GTG AAC CTC CAA GGA GTG ACA GGA GGC AAC ACA TAC TCA GTG CCT 
He Val Asn Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr Ser Val Pro 
530 535 540 



GAC AAA GAT TTT GCC CTA GAT GTC AGT GCC AAC CAG CCT GTC CTG GTG 
Asp Lys Asp Phe Ala Leu Asp Val Ser Ala Asn Gin Pro Val Leu Val 
590 595 600 605 

h § GCT GTG AAA ATG CTC CGA GCA GAT GCC AAC AAG AAT GCC AGG AAT GAT 

|U Ala Val Lys Met Leu Arg Ala Asp Ala Asn Lys Asn Ala Arg Asn Asp 

If! 610 615 620 



1992 



2040 



2088 



GCC GTC ACC ATG GAC CTG CTC TCA GGA AAA GAT GTG GCT GTG GAG GAG 
Ala Val Thr Met Asp Leu Leu Ser Gly Lys Asp Val Ala Val Glu Glu 
545 550 555 

TTC CCC AGG AAA CTC CTA ACT TTC AAA GAG AAG CTG GGA GAA GGA CAG 
Phe Pro Arg Lys Leu Leu Thr Phe Lys Glu Lys Leu Gly Glu Gly Gin 
560 565 570 

TTT GGG GAG GTT CAT CTC TGT GAA GTG GAG GGA ATG GAA AAA TTC AAA 213 6 

Phe Gly Glu Val His Leu Cys Glu Val Glu Gly Met Glu Lys Phe Lys 
575 580 585 



2184 



2232 



. TTT CTT AAG GAG ATA AAG ATC ATG TCT CGG CTC AAG GAC CCA AAC ATC 2280 

i'lPs Phe Leu L y s Glu Lys He Met Ser Arg Leu Lys Asp Pro Asn He 

m 625 630 635 

ATC CAT CTA TTA TCT GTG TGT ATC ACT GAT GAC CCT CTC TGT ATG ATC 2 32 8 

U He His Leu Leu Ser Val Cys He Thr Asp Asp Pro Leu Cys Met He 

4p 64 0 645 650 



ACT GAA TAC ATG GAG AAT GGA GAT CTC AAT CAG TTT CTT TCC CGC CAC 
Thr Glu Tyr Met Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser Arg His 
65 5 660 665 

GAG CCC CCT AAT TCT TCC TCC AGC GAT GTA CGC ACT GTC AGT TAC ACC 
Glu Pro Pro Asn Ser Ser Ser Ser Asp Val Arg Thr Val Ser Tyr Thr 
670 675 680 685 

AAT CTG AAG TTT ATG GCT ACC CAA ATT GCC TCT GGC ATG AAG TAC CTT 
Asn Leu Lys Phe Met Ala Thr Gin He Ala Ser Gly Met Lys Tyr Leu 
690 695 700 

TCC TCT CTT AAT TTT GTT CAC CGA GAT CTG GCC ACA CGA AAC TGT TTA 
Ser Ser Leu Asn Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu 
705 710 715 

GTG GGT AAG AAC TAC ACA ATC AAG ATA GCT GAC TTT GGA ATG AGC AGG 2568 
Val Gly Lys Asn Tyr Thr He Lys He Ala Asp Phe Gly Met Ser Arg 
720 725 730 



2376 



2424 



2472 



2520 
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AAC CTG TAC AGT GOT GAC TAT TAC CGG ATC CAG GGC CGG GCA GTG CTC 2616 
Asn Leu Tyr Ser Gly Asp Tyr Tyr Arg He Gin Gly Arg Ala Val Leu 
735 740 745 

CCT ATC CGC TGG ATG TCT TGG GAG AGT ATC TTG CTG GGC AAG TTC ACT 
Pro He Arg Trp Met Ser Trp Glu Ser He Leu Leu Gly Lys Phe Thr 
750 7 " 760 r 765 

ACA GCA AGT GAT GTG TGG GCC TTT GGG GTT ACT TTG TGG GAG ACT TTC 
Thr Ala Ser Asp Val Trp Ala Phe Gly Val Thr Leu Trp Glu Thr Phe 
770 775 780 

ACC TTT TGT CAA GAA CAG CCC TAT TCC CAG CTG TCA GAT GAA CAG GTT 
Thr Phe Cys Gin Glu Gin Pro Tyr Ser Gin Leu Ser Asp Glu Gin Val 
7 85 790 795 

ATT GAG AAT ACT GGA GAG TTC TTC CGA GAC CAA GGG AGG CAG ACT TAC 
He Glu Asn Thr Gly Glu Phe Phe Arg Asp Gin Gly Arg Gin Thr Tvr 
80 ° 805 810 

CTC CCT CAA CCA GCC ATT TGT CCT GAC TCT GTG TAT AAG CTG ATG CTC 
Leu Pro Gin Pro Ala He Cys Pro Asp Ser Val Tyr Lys Leu Met Leu 
815 820 825 

O 

,£ AGC TGC TGG AGA AGA GAT ACG AAG AAC CGT CCC TCA TTC CAA GAA ATC 2 904 

yl Ser Cys Trp Arg Arg Asp Thr Lys Asn Arg Pro Ser Phe Gin Glu He 

ID 830 835 840 845 

S "" CAC CTT CTG CTC CTT CAA CAA GGC GAC GAG TGATGCTGTC AGTGCCTGGC 2954 

His Leu Leu Leu Leu Gin Gin Gly Asp Glu 
850 855 

CATGTTCCTA CGGCTCAGGT CCTCCCTACA AGACCTACCA CTCACCCATG CCTATGCCAC 3014 

TCCATCTGGA CATTTAATGA AACTGAGAGA CAGAGGCTTG TTTGCTTTGC CCTCTTTTCC 3074 

TGGTCACCCC CACTCCCTAC CCCTGACTCA TATATACTTT TTTTTTTTAC ATTAAAGAAC 3134 

TAAAAAAAAA AAAAAAAAAG GCG 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 855 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met lie Leu He Pro Arg Met Leu Leu Val Leu Phe Leu Leu Leu Pro 
1 . 5 10 15 

He Leu Ser Ser Ala Lys Ala Gin Val Asn Pro Ala He Cys Arg Tyr 
20 25 30 



2664 



2712 



2760 



2808 



2856 



3157 
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Pro Leu Gly Met Ser Gly Gly Gin lie Pro Asp Glu Asp lie Thr Ala 
35 40 45 

Ser Ser Gin Trp Ser Glu Ser Thr Ala Ala Lys Tyr Gly Arg Leu Asp 
50 55 60 

Ser Glu Glu Gly Asp Gly Ala Trp Cys Pro Glu lie Pro Val Glu Pro 
65 70 75 80 

Asp Asp Leu Lys Glu Phe Leu Gin lie Asp Leu His Thr Leu His Phe 
85 90 95 

lie Thr Leu Val Gly Thr Gin Gly Arg His Ala Gly Gly His Gly lie 
100 105 110 

Glu Phe Ala Pro Met Tyr Lys lie Asn Tyr Ser Arg Asp Gly Thr Arg 
115 120 125 

Trp He Ser Trp Arg Asn Arg His Gly Lys Gin Val Leu Asp Gly Asn 
130 135 140 

Ser Asn Pro Tyr Asp He Phe Leu Lys Asp Leu Glu Pro Pro lie Val 
145 150 155 160 

Ala Arg Phe Val Arg Phe He Pro Val Thr Asp His Ser Met Asn Val 
165 170 175 

Cys Met Arg Val Glu Leu Tyr Gly Cys Val Trp Leu Asp Gly Leu Val 
180 185 190 

Ser Tyr Asn Ala Pro Ala Gly Gin Gin Phe Val Leu Pro Gly Gly Ser 
195 200 205 

He He Tyr Leu Asn Asp Ser Val Tyr Asp Gly Ala Val Gly Tyr Ser 
210 215 220 

Met Thr Glu Gly Leu Gly Gin Leu Thr Asp Gly Val Ser Gly Leu Asp 
225 230 235 240 

Asp Phe Thr Gin Thr His Glu Tyr His Val Trp Pro Gly Tyr Asp Tyr 
245 250 255 

Val Gly Trp Arg Asn Glu Ser Ala Thr Asn Gly Tyr He Glu He Met 
2 60 265 270 

Phe Glu Phe Asp Arg He Arg Asn Phe Thr Thr Met Lys Val His Cys 
275 280 285 

Asn Asn Met Phe Ala Lys Gly Val Lys He Phe Lys Glu Val Gin Cys 
290 295 300 

Tyr Phe Arg Ser Glu Ala Ser Glu Trp Glu Pro Asn Ala He Ser Phe 
305 310 315 320 

Pro Leu Val Leu Asp Asp Val Asn Pro Ser Ala Arg Phe Val Thr Val 
325 330 335 
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Pro Leu His His Arg Met Ala Ser Ala He Lys Cys Gin Tyr His Phe 
340 345 350 

Ala Asp Thr Trp Met Met Phe Ser Glu He Thr Phe Gin Ser Asp Ala 
355 360 365 

Ala Met Tyr Asn Asn Ser Glu Ala Leu Pro Thr Ser Pro Met Ala Pro 
370 375 380 

Thr Thr Tyr Asp Pro Met Leu Lys Val Asp Asp Ser Asn Thr Arg He 
385 390 395 400 

Leu He Gly Cys Leu Val Ala He He Phe He Leu Leu Ala He He 
405 410 415 

Val He He Leu Trp Arg Gin Phe Trp Gin Lys Met Leu Glu Lys Ala 
420 425 430 

Ser Arg Arg Met Leu Asp Asp Glu Met Thr Val Ser Leu Ser Leu Pro 
435 440 445 

Ser Asp Ser Ser Met Phe Asn Asn Asn Arg Ser Ser Ser Pro Ser Glu 
450 455 460 

Gin Gly Ser Asn Ser Thr Tyr Asp Arg He Phe Pro Leu Arg Pro Asp 
465 470 475 480 

Tyr Gin Glu Pro Ser Arg Leu He Arg Lys Leu Pro Glu Phe Ala Pro 
485 490 495 

Gly Glu Glu Glu Ser Gly Cys Ser Gly Val Val Lys Pro Val Gin Pro 
500 505 510 

Ser Gly Pro Glu Gly Val Pro His Tyr Ala Glu Ala Asp He Val Asn 
515 520 525 

Leu Gin Gly Val Thr Gly Gly Asn Thr Tyr Ser Val Pro Ala Val Thr 
530 535 540 

Met Asp Leu Leu Ser Gly Lys Asp Val Ala Val Glu Glu Phe Pro Arg 
545 550 555 560 

Lys Leu Leu Thr Phe Lys Glu Lys Leu Gly Glu Gly Gin Phe Gly Glu 
565 570 575 

Val His Leu Cys Glu Val Glu Gly Met Glu Lys Phe Lys Asp Lys Asp 
580 585 590 

Phe Ala Leu Asp Val Ser Ala Asn Gin Pro Val Leu Val Ala Val Lys 
595 600 605 

Met Leu Arg Ala Asp Ala Asn Lys Asn Ala Arg Asn Asp Phe Leu Lys 
61 <> 615 620 

Glu He Lys He Met Ser Arg Leu Lys Asp Pro Asn He He His Leu 
625 630 6 35 6 40 



-67- 



Leu Ser Val Cys He Thr Asp Asp Pro Leu Cys Met He Thr Glu Tyr 
64 5 650 655 

Met Glu Asn Gly Asp Leu Asn Gin Phe Leu Ser Arg His Glu Pro Pro 
660 665 670 

Asn Ser Ser Ser Ser Asp Val Arg Thr Val Ser Tyr Thr Asn Leu Lys 
675 680 685 

Phe Met Ala Thr Gin He Ala Ser Gly Met Lys Tyr Leu Ser Ser Leu 
690 695 700 

Asn Phe Val His Arg Asp Leu Ala Thr Arg Asn Cys Leu Val Gly Lys 
705 7 10 715 720 

Asn Tyr Thr He Lys He Ala Asp Phe Gly Met Ser Arg Asn Leu Tyr 
725 730 735 

Ser Gly Asp Tyr Tyr Arg He Gin Gly Arg Ala Val Leu Pro He Arg 
740 745 750 

Trp Met Ser Trp Glu Ser He Leu Leu Gly Lys Phe Thr Thr Ala Ser 
755 760 765 

Asp Val Trp Ala Phe Gly Val Thr Leu Trp Glu Thr Phe Thr Phe Cys 
770 775 780 

Gin Glu Gin Pro Tyr Ser Gin Leu Ser Asp Glu Gin Val He Glu Asn 
785 790 795 800 

Thr Gly Glu Phe Phe Arg Asp Gin Gly Arg Gin Thr Tyr Leu Pro Gin 
805 810 815 

Pro Ala He Cys Pro Asp Ser Val Tyr Lys Leu Met Leu Ser Cys Trp 
820 8 25 830 

Arg Arg Asp Thr Lys Asn Arg Pro Ser Phe Gin Glu He His Leu Leu 
835 840 845 

Leu Leu Gin Gin Gly Asp Glu 
850 855 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /note- "Ala can be enchanged for 
any amino acid" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Asn Pro Ala Tyr 
1 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Thr Tyr Ala Xaa Pro Xaa Xaa Xaa Pro Gly 
1 5 io 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

His Arg Asp Leu Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
GGAATTCCCA YMGNRAYYTN RCNRCNMG 
(2) INFORMATION FOR SEQ ID NO : 9 : 



69 



in 



V.: J 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY: Modified- site 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /note- "Xaa can be either Phe or 

Tyr " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Ser Asp Val Trp Ser Xaa 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
3 (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
|^ GGAATTCCYW YNSWGGTNTG SAGNST 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

His Phe Asp Pro Ala Lys Asp Cys Arg Tyr Ala Leu Gly Met Gin Asp 
15 10 15 

Arg Thr lie 
(2) INFORMATION FOR SEQ ID NO: 12: 
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26 



3 



4 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Pro Pro Phe Ser Gin Leu His Arg Phe Leu Ala Glu Asp Ala Leu 
Asn Thr Val 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
Pro Ala Met Ala Trp Glu Gly Glu Pro Met Arg His Asn Leu 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



10 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Cys Trp Ser Arg Glu Ser Glu Gin Arg Pro Pro Phe Ser Gin Leu His 
1 5 10 15 

Arg 
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